Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models
نویسندگان
چکیده
In this paper, we detect advertisements in French broadcast news by locating both repeated and non-repeated ads. The non-repeated ads are located by using Gaussian mixture models (GMMs) to discriminate between program and ad segments. The repeated ad detection stage first uses features generated by a symmetric KL2 metric to locate repeated 5-sec audio segments. These repeated segments are then verified and extended through a detailed matching algorithm that uses cepstral features. The proposed repeated advertisement detection algorithms detect repeated audio reliably, resulting in 33.2% advertisement detection error rate (AER). The 26.0% missed ads are due to ads not being repeated, while the 7.2% false alarms are due to short repeated segments in the program. Using GMMs to classify repeated segments as program or ad reduces the AER to 30.1%. To locate non-repeated ads in program segments, we divide the audio between these repeated ads into short segments, and classify each segment as a program or an ad using these GMMs. This reduces the AER from 30.1% to 13.0%. We improve the segment boundaries between programs and ads by Viterbi alignment. This re-alignment reduces the AER from 13.0% to 10.6% (96.7% recall and 93.0% precision). Overall, we detect 87% of the non-repeated ads.
منابع مشابه
Transcribing broadcast news shows
While significant improvements have been made over the last 5 years in large vocabulary continuous speech recognition of large read-speech corpora such as the ARPA Wall Street Journal-based CSR corpus (WSJ) for American English and the BREF corpus for French, these tasks remain relatively artificial. In this paper we report on our development work in moving from laboratory read speech data to r...
متن کاملSpoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملTranscribing Broadcast News: The LIMSI Nov96 Hub4 System
In this paper we report on the LIMSI Nov96 Hub4 system for transcription of broadcast news shows. We describe the development work in moving from laboratory read speech data to realworld speech data in order to build a system for the ARPA Nov96 evaluation. Two main problems were addressed to deal with the continuous flow of inhomogenous data. These concern the varied acoustic nature of the sign...
متن کاملTranscribing Broadcast News: The LIMSI Nov96
In this paper we report on the LIMSI Nov96 Hub4 system for transcription of broadcast news shows. We describe the development work in moving from laboratory read speech data to realworld speech data in order to build a system for the ARPA Nov96 evaluation. Two main problems were addressed to deal with the continuous flow of inhomogenous data. These concern the varied acoustic nature of the sign...
متن کاملThe LIMSI 1997 Hub-4E Transcription System
In this paper we report on the LIMSI system used in the Nov’97 Hub-4E benchmark test on transcription of American English broadcast news shows. There are two main differences from the LIMSI system developed for the Nov’96 evaluation. The first concerns the preprocessing stages for partitioning the data, and the second concerns a reduction in the number of acoustic model sets used to deal with t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008